Method for voice calling, method for voice playing and devices thereof

ABSTRACT

This disclosure describes a method for voice calling, voice playing, and devices thereof, in the field of communication. The method for voice calling includes: detecting whether a sensed distance value of an on-site terminal is less than a preset threshold, entering a predetermined call mode if the sensed distance value is less than the preset threshold, auto-playing a voice signal of an opposite terminal in the predetermined call mode, and if a voice signal of the on-site terminal is detected in the predetermined call mode, collecting the voice signal of the on-site terminal, and transmitting the voice signal of the on-site terminal to the opposite terminal

This application is a National Phase Application of InternationalApplication No. PCT/CN2014/082916, filed July 24, 2014, which is basedupon and claims priority to Chinese Patent Application No.201410131152.5, filed Apr. 2, 2014, the entire contents of which areincorporated herein by reference.

TECHNICAL FIELD

The present disclosure relates to the field of communication, and moreparticularly, to a method for voice calling, a method for voice playingand devices thereof.

BACKGROUND

Currently, domestic operating agencies do not permit Internet serviceproviders to provide two-way voice call service, such as network phoneservice, in China, but allow Internet service providers to provideone-way voice call service, such as network interphone service, inChina.

The function of network interphone may be provided by social applicationclients in terminals. During a voice call, a user “A” presses anintercom button on an interface of a social application client “A,” thenissues a voice of call; the social application client “A” records thevoice signal during the press of the intercom button, and then transmitsthe voice signal to a social application client “B” after the recordingis completed. After receiving the voice signal, the social applicationclient “B” displays an unread voice signal on its interface, a user “B”clicks the unread voice signal on the interface of the socialapplication terminal “B,” and then the social application client “B”plays the voice signal through a speaker.

During the implementation of the present disclosure, the inventor findsat least the following defects existing in the above-mentioned manner:when transmitting the voice signal, the user must press the intercombutton continuously. When playing the voice signal, the user also needsto click the unread voice signal, and when it is inconvenient for a userto perform a press operation or a click operation on a terminal, theuser is unable to use the network interphone service smoothly.

SUMMARY

In order to solve, in part, the problem that the user can not smoothlyuse the network interphone service when it is inconvenient for the userto perform a press operation or a click operation on the terminal, thepresent disclosure provides a method for voice calling, a method forvoice playing and devices thereof. The technical solutions are asfollows.

According to a first aspect of the present disclosure, there is provideda method for voice calling, the method includes:

detecting whether a sensed distance value of an on-site terminal issmaller than a preset threshold;

entering a predetermined call mode if it is detected that the senseddistance value is smaller than the preset threshold;

auto-playing a voice signal of an opposite terminal in the predeterminedcall mode; and

when a voice signal of the on-site terminal is detected in thepredetermined call mode, collecting the voice signal of the on-siteterminal, and transmitting the voice signal of the on-site terminal tothe opposite terminal.

According to a second aspect of the embodiments of the presentdisclosure, there is provided a method for voice playing, the methodincludes:

detecting whether a sensed distance value of an on-site terminal issmaller than a preset threshold;

if it is detected that the sensed distance value is smaller than thepreset threshold, entering a predetermined play mode; and

auto-playing a voice signal of an opposite terminal in the predeterminedplay mode.

According to a third aspect of the embodiments of the presentdisclosure, there is provided a device for voice calling, the deviceincludes:

a processor;

a memory storing instructions executable by the processor;

wherein the processor is configured to:

detect whether a sensed distance value of an on-site terminal is smallerthan a preset threshold;

enter a predetermined call mode if it is detected that the senseddistance value is smaller than the preset threshold;

auto-play a voice signal of an opposite terminal in the predeterminedcall mode; and

if the voice signal of the on-site terminal is detected in thepredetermined call mode, collect the voice signal of the on-siteterminal, and transmit the voice signal of the on-site terminal to theopposite terminal.

According to a fourth aspect of the embodiments of the presentdisclosure, there is provided a device for voice playing, the deviceincludes:

a processor;

a memory storing instructions executable by the processor;

wherein the processor is configured to:

detect whether a sensed distance value of an on-site terminal is smallerthan a preset threshold;

enter a predetermined play mode when it is detected that the senseddistance value is smaller than the preset threshold; and

auto-play a voice signal of an opposite terminal in the predeterminedplay mode.

According to a fifth aspect of the embodiments of the presentdisclosure, there is provided a non-transitory readable storage mediumincluding instructions, executable by a processor in a terminal, forperforming a method for voice calling, the method including:

detecting whether a sensed distance value of the terminal is smallerthan a preset threshold;

entering a predetermined call mode if it is detected that the senseddistance value is smaller than the preset threshold;

auto-playing a voice signal of an opposite terminal in the predeterminedcall mode; and

when a voice signal of the terminal is detected in the predeterminedcall mode, collecting the voice signal of the terminal, and transmittingthe voice signal of the terminal to the opposite terminal.

According to a sixth aspect of the embodiments of the presentdisclosure, there is provided a non-transitory readable storage mediumincluding instructions, executable by a processor in a terminal, forperforming a method for voice calling, the method including:

detecting whether a sensed distance value of the terminal is smallerthan a preset threshold;

if it is detected that the sensed distance value is smaller than thepreset threshold, entering a predetermined play mode; and

auto-playing a voice signal of an opposite terminal in the predeterminedplay mode.

The technical schemes provided by the embodiments of the presentdisclosure have, in part, the following beneficial effects:

By entering the predetermined call mode when detecting that the senseddistance value is smaller than the preset threshold; auto-playing thevoice signal of the opposite terminal in the predetermined call mode;and when the voice signal of the on-site terminal is detected in thepredetermined call mode, collecting the voice signal of the on-siteterminal, and transmitting the voice signal of the on-site terminal tothe opposite terminal, the problem that the user can not smoothly usethe network interphone services when it is inconvenient for the user toperform a press operation or a click operation on the terminal issolved, the effects that voice playing and voice collecting of thenetwork interphone can be automatically triggered as long as the userputs the terminal close to his ear, and the network interphone can beused without any other operation are achieved, and a user experience ofan approximate two-way conversation can be obtained.

It is to be understood that both the foregoing general description andthe following detailed description are exemplary and explanatory onlyand are not restrictive of the invention, as claimed.

BRIEF DESCRIPTION OF THE DRAWINGS

The accompanying drawings, which are incorporated in and constitute apart of this specification, illustrate embodiments consistent with theinvention and, together with the description, serve to explain theprinciples of the invention.

FIG. 1 is a block diagram illustrating an implementation environmentrelated to the method for voice calling shown in respective exemplaryembodiments of the present disclosure;

FIG. 2 is a flow chart illustrating the method for voice callingaccording to an exemplary embodiment;

FIG. 3A is a flow chart illustrating the method for voice callingaccording to another exemplary embodiment;

FIG. 3B is a flow chart illustrating one way of performing a step 303 inthe embodiment shown in FIG. 3A;

FIG. 4 is a flow chart illustrating the method for voice playingaccording to an exemplary embodiment;

FIG. 5 is a flow chart illustrating the method for voice playingaccording to another exemplary embodiment;

FIG. 6 is a block diagram illustrating the device for voice callingaccording to an exemplary embodiment;

FIG. 7 is a block diagram illustrating the device for voice callingaccording to another exemplary embodiment;

FIG. 8 is a block diagram illustrating the device for voice playingaccording to an exemplary embodiment;

FIG. 9 is a block diagram illustrating the device for voice playingaccording to another exemplary embodiment; and

FIG. 10 is a block diagram illustrating the terminal according to anexemplary embodiment of the present disclosure.

DETAILED DESCRIPTION

In order to make the purposes, the technical solutions and theadvantages of the present disclosure more apparent, the presentdisclosure will be further described in detail below with reference tothe accompanying drawings. It is apparent that the described embodimentsare only a part of embodiments of the present disclosure, but not allthe embodiments. All the other embodiments obtained by the personskilled in the art without inventive labor belong to the protectionscope of the present disclosure.

FIG. 1 is a block diagram showing an implementation environment relatedto the method for voice calling shown in respective exemplaryembodiments of the present disclosure. The implementation environmentincludes an on-site terminal 120 and an opposite terminal 140.

The on-site terminal 120 may be a mobile phone, a tablet PC, an e-bookreader, a MP3 (Moving Picture Experts Group Audio Layer III) player, aMP4 (Moving Picture Experts Group Audio Layer IV) player, etc. A client,which is a client with a one-way voice call function such as networkinterphone function, runs in the on-site terminal 120. A distance sensorfor collecting the sensed distance value in front of a screen of theon-site terminal 120 is also provided in the on-site terminal 120.

The on-site terminal 120 may make a one-way voice call with the oppositeterminal 140 via a wired network or a wireless network.

The opposite terminal 140 may be a mobile phone, a tablet PC, an e-bookreader, a MP3 (Moving Picture Experts Group Audio Layer III) player, aMP4 (Moving Picture Experts Group Audio Layer IV) player, etc. A client,which is a client with a one-way voice call function such as networkinterphone function, also runs in the opposite terminal 140. A distancesensor for collecting the sensed distance value in front of a screen ofthe opposite terminal 140 is also provided in the opposite terminal 140.

The above-mentioned client may be a social client having both theinstant messaging function and the one-way voice call function, similarto WeChat client provided by Tencent Technology Co., Ltd., China,Mi-talk client provided by Xiaomi Inc., China, Laiwang client providedby Alibaba company, China, and Line client provided by South KoreaInternet Group NHN.

FIG. 2 is a flow chart illustrating the method for voice callingaccording to an exemplary embodiment. This embodiment is illustrated byapplying the method for voice calling to the on-site terminal in FIG. 1.The method for voice calling includes the following steps.

In step 201, whether a sensed distance value of an on-site terminal issmaller than a preset threshold is detected.

In step 202, a predetermined call mode is entered if it is detected thatthe sensed distance value is smaller than the preset threshold.

In step 203, in the predetermined call mode, a voice signal of anopposite terminal is auto-played.

In step 204, if a voice signal of the on-site terminal is detected inthe predetermined call mode, the voice signal of the on-site terminal iscollected, and the voice signal of the on-site terminal is transmittedto the opposite terminal.

In summary, in the method for voice calling provided by the presentembodiment, by entering the predetermined call mode when detecting thatthe sensed distance value is smaller than the preset threshold;auto-playing the voice signal of the opposite terminal in thepredetermined call mode; and when the voice signal of the on-siteterminal is detected in the predetermined call mode, collecting thevoice signal of the on-site terminal, and transmitting the voice signalof the on-site terminal to the opposite terminal, the problem that theuser can not smoothly use the network interphone when it is inconvenientfor the user to perform a press operation or a click operation on theterminal is solved, the effects that voice playing and voice collectingof the network interphone can be automatically triggered as long as theuser puts the terminal close to his ear, and the network interphone canbe used without any other operation are achieved, and a user experienceof an approximate two-way conversation can be obtained.

FIG. 3A is a flow chart illustrating the method for voice callingaccording to another exemplary embodiment. This embodiment isillustrated by applying the method for voice calling to the on-siteterminal in FIG. 1. The method for voice calling includes the followingsteps.

In step 301, whether a sensed distance value of an on-site terminal issmaller than a preset threshold is detected.

The on-site terminal obtains the sensed distance value in front of theterminal through a built-in distance sensor. The distance sensor isusually provided on a location at the top of a screen of the on-siteterminal and near a handset.

When a client with a one-way voice call function in the on-site terminalis operating, the on-site terminal detects whether the sensed distancevalue is smaller than the preset threshold every preset time interval.The preset threshold may be set by research personnel in accordance withexperimental values or empirical values. For example, if the on-siteterminal is a mobile phone, the research personnel may obtain the presetthreshold via statistical analysis by repeatedly collecting the senseddistance values between the mobile phone and the user's ear during thephone call.

In step 302, if it is detected that the sensed distance value is smallerthan the preset threshold, a predetermined call mode is entered.

If the on-site terminal detects that the sensed distance value issmaller than the preset threshold, the on-site terminal is considered tobe put on a position close to the user's ear, and the on-site terminalenters the predetermined call mode. The predetermined call mode is amode provided by the present embodiment that is able to properly use theone-way call function without clicking an operation signal on the screenof the terminal by the user.

If the on-site terminal detects that the sensed distance value isgreater than the preset threshold, then the on-site terminal isconsidered to be not put on the position close to the user's ear, andthe on-site terminal keeps in an ordinary call mode, i.e., the usagemode corresponding to the network interphone provided in the Background.

In step 303, when a voice signal of the on-site terminal is detected inthe predetermined call mode, the voice signal of the on-site terminal iscollected, and the voice signal of the on-site terminal is transmittedto the opposite terminal.

In the predetermined call mode, the on-site terminal automaticallydetects whether the voice signal of the on-site terminal exists, and ifthe user issues a voice, the on-site terminal may detect the voicesignal of the on-site terminal, then automatically collect the voicesignal of the on-site terminal, and send the voice signal of the on-siteterminal to the opposite terminal by using the one-way voice callfunction. This process does not require the user's other operations, andis completed automatically by the on-site terminal.

In one possible way of implementation, if the on-site terminal isplaying a voice signal of the opposite terminal, and the voice signal ofthe on-site terminal is detected, the on-site terminal still collectsthe voice signal of the on-site terminal, and transmits the voice signalof the on-site terminal to the opposite terminal Namely, whether theon-site terminal is playing the voice signal of the opposite terminalhas no influence on the collection of the voice signal of the on-siteterminal.

In step 304, the voice signal of the opposite terminal is playeddirectly in the predetermined call mode.

In the predetermined call mode, the on-site terminal directly uses thehandset to play the voice signal of the opposite terminal, without theuser's other operations. The voice signal of the opposite terminal is anetwork voice signal sent by the opposite terminal by using the one-wayvoice call function, other than the voice signal sent by using thetwo-way voice call function or the voice signal sent by using theconventional mobile call.

There are two ways to implement this step.

First, in the predetermined call mode, the voice signal of the oppositeterminal that has been received but not yet played is auto-played.

Namely, before entering the predetermined call mode, the on-siteterminal has received the voice signal of the opposite terminal, anddisplayed it as an unread voice signal of the opposite terminalThereafter, the on-site terminal enters the predetermined call mode, andthus the on-site terminal auto-plays the voice signal of the oppositeterminal that has been received but not yet played.

Second, in the predetermined call mode, the voice signal of the oppositeterminal is received; and the received voice signal of the oppositeterminal is auto-played.

After entering the predetermined call mode, the on-site terminalreceives the voice signal of the opposite terminal, then the on-siteterminal auto-plays the received voice signal of the opposite terminalby using the handset.

As one possible way of implementation, the on-site terminal receivingthe voice signal of the opposite terminal in the predetermined call modeincludes following step.

If the on-site terminal receives a text information of the oppositeterminal, then the on-site terminal synthesizes a voice signal of theopposite terminal according to the text information of the oppositeterminal.

Namely, during a process of using the one-way voice call, the oppositeterminal may also send the text information to the on-site terminal Ifthe on-site terminal receives the text information of the oppositeterminal in the predetermined call mode, then the on-site terminalsynthesizes a voice signal of the opposite terminal according to thetext information of the opposite terminal and the voice synthesistechnology, and auto-plays the synthesized voice signal of the oppositeterminal by using the handset, without the user's other operations.

In step 305, after entering the predetermined call mode, whether thesensed distance value of the on-site terminal is greater than the presetthreshold is detected.

After entering the predetermined call mode, the on-site terminal detectswhether the sensed distance value is greater than the preset thresholdevery preset time interval.

In step 306, if it is detected that the sensed distance value is greaterthan the preset threshold, the predetermined call mode is exited.

If it is detected that the sensed distance value is smaller than thepreset threshold, the on-site terminal keeps in the predetermined callmode.

If it is detected that the sensed distance value is greater than thepreset threshold, the on-site terminal exits from the predetermined callmode and then enters a normal mobile call mode. In the normal mobilecall mode, if the on-site terminal receives the voice signal of theopposite terminal, the on-site terminal plays the voice signal of theopposite terminal only when receiving a trigger signal. In theconventional mobile call mode, only when the on-site terminal receives asignal of constantly pressing an intercom button, the on-site terminalcollects the voice signal of the on-site terminal and sends it to theopposite terminal.

It should be explained additionally that, the present embodiment alsoprovides a mechanism for buffering the voice signal of the oppositeterminal, as shown in FIG. 3B, for a second way of implementing step304, the following sub-steps may be further included.

In step 304 a, after receiving the voice signal of the oppositeterminal, whether the voice signal of the on-site terminal is beingcollected is detected.

After receiving the voice signal of the opposite terminal, the on-siteterminal may firstly detect whether the voice signal of the on-siteterminal is being collected, namely, whether the user is talking. If itis detected that the voice signal of the on-site terminal is beingcollected, step 304 b is entered; and if it is detected that the voicesignal of the on-site terminal is not being collected, step 304 e isentered.

In step 304 b, if it is detected that the voice signal of the on-siteterminal is being collected, the received voice signal of the oppositeterminal is buffered.

If the on-site terminal detects that the voice signal of the on-siteterminal is being collected, namely, the user is talking, the receivedvoice signal of the opposite terminal is buffered, and is not played forthe moment. The voice signal of the opposite terminal herein includesthe voice signal of the opposite terminal directly sent by the oppositeterminal, and also includes the voice signal of the opposite terminalsynthesized by the on-site terminal according to the text information ofthe opposite terminal.

In step 304 c, whether the collection of the voice signal of the on-siteterminal has been completed is detected.

After buffering the voice signal of the opposite terminal, the on-siteterminal detects whether the collection of the voice signal of theon-site terminal has been completed, namely, whether the user hasstopped talking every preset time interval.

In order to ensure the accuracy of detection, the on-site terminaldetects every preset time interval whether the collection of the voicesignal of the on-site terminal has been completed and whether a timelength after the collection has been completed reaches a preset timelength, namely, whether the user has stopped talking, and the time ofstopping talking exceeds the preset time length.

In step 304 d, if it is detected that the collection of the voice signalof the on-site terminal has been completed, the buffered voice signal ofthe opposite terminal is auto-played.

If it is detected that the collection of the voice signal of the on-siteterminal has been completed, the on-site terminal auto-plays thebuffered voice signal of the opposite terminal. If the number of thebuffered voice signals of the opposite terminal is more than one, theon-site terminal plays the voice signals of the opposite terminal one byone in an order of the buffer time from early to late.

In step 304 e, if it is detected the voice signal of the on-siteterminal is not being collected, the voice signal of the oppositeterminal is played directly.

If it is detected that the voice signal of the on-site terminal is notcollected in step 304 a, the on-site terminal plays the voice signal ofthe opposite terminal directly.

It should be explained additionally that, after the on-site terminalentering the predetermined call mode, the on-site terminal sends aprompt information for prompting that the on-site terminal has enteredthe predetermined call mode to the opposite terminal, and the oppositeterminal may receive and display the prompt information. Similarly, ifthe opposite terminal has entered the predetermined call mode, theon-site terminal receives the prompt information for prompting that theopposite terminal has entered the predetermined call mode sent from theopposite terminal, and then the on-site terminal displays the promptinformation.

In summary, in the method for voice calling provided by the presentembodiment, by entering the predetermined call mode when detecting thatthe sensed distance value is smaller than the preset threshold;auto-playing the voice signal of the opposite terminal in thepredetermined call mode; and when the voice signal of the on-siteterminal is detected in the predetermined call mode, collecting thevoice signal of the on-site terminal, and transmitting the voice signalof the on-site terminal to the opposite terminal, the problem that theuser can not smoothly use the network interphone when it is inconvenientfor the user to perform a press operation or a click operation on theterminal is solved, the effects that voice playing and voice collectingof the network interphone can be automatically triggered as long as theuser puts the terminal close to his ear, and the network interphone canbe used without any other operation are achieved, and a user experienceof an approximate two-way conversation can be obtained.

In the method for voice calling provided by the present embodiment, bysynthesizing the received text information of the opposite terminal intothe voice signal of the opposite terminal and playing it, even theopposite terminal sends the text information, the user can still obtaina user experience of an approximate two-way conversation, and the wholeprocess does not require the user's other operations, thus improvingcommunication efficiency.

In the method for voice calling provided by the present embodiment, bybuffering the voice signal of the opposite terminal and/or thesynthesized voice signal of the opposite terminal when collecting thevoice signal of the on-site terminal; and playing the buffered voicesignal of the opposite terminal only after the voice signal of theon-site terminal has been collected completely, the user may not bedisturbed by any voice from the opposite side when talking, and it isalso ensured that the present embodiment is still a one-way conversationother than a two-way conversation, which complies with the requirementsto the operating agencies.

In one example, a user A and a user B use the clients in the terminalsto perform a network intercom. The user A and the user B both put theterminal on a position close to their ears, thus the terminal A used bythe user A and the terminal B used by the user B both enter thepredetermined call mode. At this time, if the user A is talking, theterminal A may automatically collect the voice signal, and send it tothe terminal B. After receiving the voice signal, the terminal B mayauto-play the voice signal in a handset. If the user B is talking, theterminal B may automatically collect the voice signal, and send it tothe terminal A. After receiving the voice signal, the terminal Aauto-plays the voice signal in a handset. In this way, an effect ofachieving an approximate two-way conversation by using a one-wayconversation function is realized. If the terminal B sends a voicesignal to the terminal A when the user A is talking, the terminal A maybuffer the voice signal, and auto-play the voice signal after the user Ahas finished the talking, thus the conversation between the user A andthe user B is not disturbed.

In another example, a user A and a user B use the clients in theterminals to perform a network intercom. The user A puts a terminal A ona position close to his ear, thus the terminal A used by the user A willenter the predetermined call mode; while the user B still uses aterminal B in a normal mode. At this time, if the user A is talking, theterminal A may automatically collect the voice signal, and send it tothe terminal B. After receiving the voice signal, the terminal B firstlydisplays there is one piece of unread voice, then plays the voice signalonly after receiving a clicking signal for the unread voice from theuser B. If the user B is talking, the user B needs to press a prescribedphysical button or a prescribed virtual key on the terminal B, then theterminal B may collect the voice signal of the user B, and send it tothe terminal A. After receiving the voice signal, the terminal Aauto-plays the voice signal in the handset. If the user B isinconvenient to talk, the user B may send a text information to theterminal A via the terminal B, the terminal A synthesizes the receivedtext information into the voice signal, and then auto-plays the voicesignal in the handset, thus the mixed input of text and voice may beachieved in the user B, and the user A's experience of an approximatetwo-way conversation may be realized.

FIG. 4 is a flow chart illustrating the method for voice playingaccording to an exemplary embodiment. This embodiment is illustrated byapplying the method for voice playing to the on-site terminal in FIG. 1.The method for voice playing includes the following steps.

In step 401, whether a sensed distance value of an on-site terminal issmaller than a preset threshold is detected.

In step 402, if it is detected that the sensed distance value is smallerthan the preset threshold, a predetermined play mode is entered.

In step 403, in the predetermined play mode, voice signal of an oppositeterminal is auto-played.

In summary, in the method for voice playing provided by the presentembodiment, by entering the predetermined call mode when detecting thatthe sensed distance value is smaller than the preset threshold; andauto-playing the voice signal of the opposite terminal in thepredetermined call mode, the problem that the user can not smoothly usethe network interphone when it is inconvenient for the user to perform apress operation or a click operation on the terminal is solved, and theeffect that voice playing of the network interphone can be automaticallytriggered as long as the user puts the terminal close to his ear isachieved.

FIG. 5 is a flow chart illustrating the method for voice playingaccording to an exemplary embodiment. This embodiment is illustrated byapplying the method for voice playing to the on-site terminal in FIG. 1.The method for voice playing includes the following steps.

In step 501, whether a sensed distance value of an on-site terminal issmaller than a preset threshold is detected.

The on-site terminal obtains the sensed distance value in front of theterminal through a built-in distance sensor. The distance sensor isusually provided on a location at the top of the screen of the on-siteterminal and near a handset.

When a client with a one-way voice call function in the on-site terminalis operating, the on-site terminal detects whether the sensed distancevalue is smaller than the preset threshold every preset time interval.The preset threshold may be set by research personnel in accordance withexperimental values or empirical values. For example, if the on-siteterminal is a mobile phone, the research personnel may obtain the presetthreshold via statistical analysis by repeatedly collecting the senseddistance values between the mobile phone and the user's ear during thephone call.

In step 502, if it is detected that the sensed distance value is smallerthan the preset threshold, the predetermined play mode is entered.

If the on-site terminal detects that the sensed distance value issmaller than the preset threshold, the on-site terminal is considered tobe put on a location close to the user's ear, and the on-site terminalenters the predetermined play mode.

In step 503, in the predetermined play mode, the voice signal of theopposite terminal is auto-played.

In the predetermined play mode, the on-site terminal directly uses thehandset to play the voice signal of the opposite terminal, without theuser's other operations. The voice signal of the opposite terminal is anetwork voice signal sent by the opposite terminal by using the one-wayvoice call function, other than a voice signal sent by using the two-wayvoice call function or a voice signal sent by using the conventionalmobile call.

There are two ways to implement this step.

First, in the predetermined play mode, the voice signal of the oppositeterminal that has been received but not yet played is auto-played.

Namely, before entering the predetermined play mode, the on-siteterminal has received the voice signal of the opposite terminal, anddisplayed it as an unread voice signal of the opposite terminalThereafter, the on-site terminal enters the predetermined play mode, andauto-plays the voice signal of the opposite terminal that has beenreceived but not yet played.

Second, in the predetermined play mode, the voice signal of the oppositeterminal is received; and the received voice signal of the oppositeterminal is auto-played.

After entering the predetermined play mode, the on-site terminalreceives the voice signal of the opposite terminal, then the on-siteterminal auto-plays the received voice signal of the opposite terminalby using the handset.

As one possible way of implementation, the on-site terminal receivingthe voice signal of the opposite terminal in the predetermined play modeincludes following step.

If the on-site terminal receives a text information of the oppositeterminal, then the on-site terminal synthesizes a voice signal of theopposite terminal according to the text information of the oppositeterminal.

Namely, during a process of using the one-way voice call, the oppositeterminal may also send the text information to the on-site terminal Ifthe on-site terminal receives the text information of the oppositeterminal in the predetermined play mode, then the on-site terminalsynthesizes the voice signal of the opposite terminal according to thetext information of the opposite terminal and the voice synthesistechnology, and auto-plays the synthesized voice signal of the oppositeterminal by using the handset, without the user's other operations.

In step 504, after entering the predetermined play mode, whether thesensed distance value of the on-site terminal is greater than the presetthreshold is detected.

After entering the predetermined play mode, the on-site terminal detectswhether the sensed distance value is greater than the preset thresholdevery preset time interval.

In step 505, if it is detected that the sensed distance value is greaterthan the preset threshold, the predetermined play mode is exited.

If it is detected that the sensed distance value is smaller than thepreset threshold, the on-site terminal keeps in the predetermined playmode.

If it is detected that the sensed distance value is greater than thepreset threshold, the on-site terminal exits from the predetermined playmode and then enters a normal mobile play mode. In the normal mobileplay mode, if the on-site terminal receives the voice signal of theopposite terminal, the on-site terminal plays the voice signal of theopposite terminal only when receiving a trigger signal.

In summary, in the method for voice playing provided by the presentembodiment, by entering the predetermined call mode when detecting thatthe sensed distance value is smaller than the preset threshold; andauto-playing the voice signal of the opposite terminal in thepredetermined play mode, the problem that the user can not smoothly usethe network interphone when it is inconvenient for the user to perform apress operation or a click operation on the terminal is solved, and theeffect that voice playing of the network interphone can be automaticallytriggered as long as the user puts the terminal close to his ear isachieved.

In the method for voice playing provided by the present embodiment, bygenerating the received text information of the opposite terminal intothe synthesized voice signal and playing it, even the opposite terminalsends the text information, the user still can obtain a user experienceof an approximate two-way conversation, and the whole process does notrequire the user's other operations, thus improving communicationefficiency.

The following are device embodiments of the present disclosure, whichmay be configured to perform the method embodiments of the presentdisclosure. For the details not disclosed in the device embodiments ofthe present disclosure, the method embodiments of the present disclosuremay be referred to.

FIG. 6 is a block diagram illustrating the device for voice callingaccording to an exemplary embodiment. The device for voice calling maybe implemented as a whole or a part of a terminal by using software,hardware or by combining both software and hardware. The device forvoice calling includes: a distance detecting module 620, a mode enteringmodule 640, an auto-playing module 660 and an auto-collecting module680.

The distance detecting module 620 is configured to detect whether asensed distance value of an on-site terminal is smaller than a presetthreshold.

The mode entering module 640 is configured to enter a predetermined callmode when the distance detecting module 620 detects that the senseddistance value is smaller than the preset threshold.

The auto-playing module 660 is configured to auto-play a voice signal ofan opposite terminal in the predetermined call mode.

The auto-collecting module 680 is configured to, when a voice signal ofthe on-site terminal is detected in the predetermined call mode, collectthe voice signal of the on-site terminal and transmit the voice signalof the on-site terminal to the opposite terminal.

In summary, in the device for voice calling provided by the presentembodiment, by entering the predetermined call mode when detecting thatthe sensed distance value is smaller than the preset threshold;auto-playing the voice signal of the opposite terminal in thepredetermined call mode; and when the voice signal of the on-siteterminal is detected in the predetermined call mode, collecting thevoice signal of the on-site terminal, and transmitting the voice signalof the on-site terminal to the opposite terminal, the problem that theuser can not smoothly use the network interphone when it is inconvenientfor the user to perform a press operation or a click operation on theterminal is solved, the effects that voice playing and voice collectingof the network interphone can be automatically triggered as long as theuser puts the terminal close to his ear, and the network interphone canbe used without any other operation are achieved, and a user experienceof an approximate two-way conversation can be obtained.

FIG. 7 is a diagram illustrating the device for voice calling accordingto another exemplary embodiment. The device for voice calling may beimplemented as a whole or a part of a terminal by using software,hardware or by combining both software and hardware. The device forvoice calling includes: a distance detecting module 620, a mode enteringmodule 640, an auto-playing module 660 and an auto-collecting module680.

The distance detecting module 620 is configured to detect whether asensed distance value in front of a terminal of an on-site terminal issmaller than a preset threshold.

The mode entering module 640 is configured to enter a predetermined callmode when the distance detecting module 620 detects that the senseddistance value is smaller than the preset threshold.

The auto-playing module 660 is configured to auto-play a voice signal ofan opposite terminal in the predetermined call mode.

The auto-collecting module 680 is configured to, when a voice signal ofthe on-site terminal is detected in the predetermined call mode, collectthe voice signal of the on-site terminal, and transmit the voice signalof the on-site terminal to the opposite terminal.

The auto-playing module 660 includes:

a buffer playing unit 662 configured to auto-play the voice signal ofthe opposite terminal that has been received but not yet played in thepredetermined call mode;

or,

a voice receiving unit 664 configured to receive the voice signal of theopposite terminal in the predetermined call mode; and an auto-playingunit 666 configured to auto-play the received voice signal of theopposite terminal.

The auto-playing unit 666 includes: a collection detecting sub-unit, asignal buffering sub-unit, a completion detecting sub-unit and a bufferplaying sub-unit.

The collection detecting sub-unit is configured to detect whether theauto-collecting module is collecting the voice signal of the on-siteterminal after receiving the voice signal of the opposite terminal.

The signal buffering sub-unit is configured to buffer the voice signalof the opposite terminal when the collection detecting sub-unit detectsthat the voice signal of the on-site terminal is being collected.

The completion detecting sub-unit is configured to detect whether thevoice signal of the on-site terminal has been collected completely. Thecompletion detecting sub-unit is configured to detect whether the voicesignal of the on-site terminal has been collected completely and whethera time length elapsed after the collection has been completed reaches apreset time length.

The buffer playing sub-unit is configured to auto-play the bufferedvoice signal of the opposite terminal when the completion detectingsub-unit detects that the voice signal of the on-site terminal has beencollected completely.

The voice receiving unit 664 includes: a text receiving sub-unit and avoice synthesizing sub-unit.

The text receiving sub-unit is configured to receive a text informationof the opposite terminal in the predetermined call mode.

The voice synthesizing sub-unit is configured to synthesize a voicesignal of the opposite terminal according to the text information of theopposite terminal.

The auto-collecting module 680 is configured to, when the auto-playingmodule 660 is playing the voice signal of the opposite terminal and thevoice signal of the on-site terminal is detected, collect the voicesignal of the on-site terminal, and transmit the voice signal of theon-site terminal to the opposite terminal.

The device further includes: a re-detecting module 692 and a modeexiting module 694.

The re-detecting module 692 is configured to detect whether a senseddistance value of the on-site terminal is greater than the presetthreshold after the mode entering module 640 enters the predeterminedcall mode.

The mode exiting module 694 is configured to exit from the predeterminedcall mode when the re-detecting module 692 detects that the senseddistance value is greater than the preset threshold.

In summary, in the device for voice calling provided by the presentembodiment, by entering the predetermined call mode when detecting thatthe sensed distance value is smaller than the preset threshold;auto-playing the voice signal of the opposite terminal in thepredetermined call mode; and when the voice signal of the on-siteterminal is detected in the predetermined call mode, collecting thevoice signal of the on-site terminal, and transmitting the voice signalof the on-site terminal to the opposite terminal, the problem that theuser can not smoothly use the network interphone when it is inconvenientfor the user to perform a press operation or a click operation on theterminal is solved, the effects that voice playing and voice collectingof the network interphone can be automatically triggered as long as theuser puts the terminal close to his ear, and the network interphone canbe used without any other operation are achieved, and a user experienceof an approximate two-way conversation can be obtained.

In the device for voice calling provided by the present embodiment, bysynthesizing the received text information of the opposite terminal intothe voice signal of the opposite terminal and playing it, even theopposite terminal sends the text information, it still enable a user toobtain a user experience of an approximate two-way conversation, and thewhole process does not require the user's other operations, thusimproving communication efficiency.

In the device for voice calling provided by the present embodiment, bybuffering the voice signal of the opposite terminal and/or thesynthesized voice signal of the opposite terminal when collecting thevoice signal of the on-site terminal; and playing the buffered voicesignal of the opposite terminal only after the voice signal of theon-site terminal has been collected completely, the user may not bedisturbed by any voice of other side when talking, and it is alsoensured that the present embodiment is still a one-way conversationother than a two-way conversation, which may comply with therequirements of the operating agencies.

FIG. 8 is a block diagram illustrating the device for voice playingaccording to an exemplary embodiment. The device for voice playing maybe implemented as a whole or a part of a terminal by using software,hardware or by combining both software and hardware. The device forvoice playing includes:

a distance detecting module 820 configured to detect whether a senseddistance value of an on-site terminal is smaller than a presetthreshold;

a play mode module 840 configured to enter a predetermined play modewhen the distance detecting module 820 detects that the sensed distancevalue is smaller than the preset threshold; and

an auto-playing module 860 configured to auto-play the voice signal ofthe opposite terminal in the predetermined play mode.

In summary, in the device for voice playing provided by the presentembodiment, by entering the predetermined play mode when it is detectedthat the sensed distance value is smaller than the preset threshold, andauto-playing the voice signal of the opposite terminal in thepredetermined play mode, the problem that the user can not smoothly usethe network interphone when it is inconvenient for the user to perform apress operation or a click operation on the terminal is solved, and theeffect that voice playing of the network interphone can be automaticallytriggered as long as the user puts the terminal close to his ear isachieved.

FIG. 9 is a block diagram illustrating the device for voice playingaccording to another exemplary embodiment. The device for voice playingmay be implemented as a whole or a part of a terminal by using software,hardware or by combining both software and hardware. The device forvoice playing includes:

a distance detecting module 820 configured to detect whether a senseddistance value of an on-site terminal is smaller than a presetthreshold;

a play mode module 840 configured to enter a predetermined play modewhen the distance detecting module 820 detects that the sensed distancevalue is smaller than the preset threshold; and

an auto-playing module 860 configured to auto-play a voice signal of anopposite terminal in the predetermined play mode.

The auto-playing module 860 includes:

a buffer playing unit 862 configured to auto-play the voice signal ofthe opposite terminal that has been received but not yet played in thepredetermined play mode; or,

a voice receiving unit 864 configured to receive the voice signal of theopposite terminal in the predetermined play mode; and an auto-playingunit 866 configured to auto-play the received voice signal of theopposite terminal.

The voice receiving unit 864 includes: a text receiving sub-unit and avoice synthesizing sub-unit.

The text receiving sub-unit is configured to receive a text informationof the opposite terminal in the predetermined play mode.

The voice synthesizing sub-unit is configured to synthesize a voicesignal of the opposite terminal according to the text information of theopposite terminal.

The device further includes: a re-detecting module 882 and a playexiting module 884.

The re-detecting module 882 is configured to detect whether the senseddistance value of the on-site terminal is greater than the presetthreshold after the play mode module 840 enters the predetermined playmode.

The play exiting module 884 is configured to exit from the predeterminedplay mode when the re-detecting module 882 detects that the senseddistance value is greater than the preset threshold.

In summary, in the device for voice playing provided by the presentembodiment, by entering the predetermined play mode when detecting thatthe sensed distance value is smaller than the preset threshold, andauto-playing the voice signal of the opposite terminal when receivingthe voice signal of the opposite terminal in the predetermined playmode, the problem that the user can not smoothly use the networkinterphone when it is inconvenient for the user to perform a pressoperation or a click operation on the terminal is solved, and the effectthat voice playing of the network interphone can be automaticallytriggered as long as the user puts the terminal close to his ear isachieved.

In the device for voice playing provided by the present embodiment, bysynthesizing the received text information of the opposite terminal intothe voice signal of the opposite terminal so as to play it, even theopposite terminal sends the text information, it still enable a user toobtain a user experience of an approximate two-way conversation, and thewhole process does not require the user's other operations, thusimproving communication efficiency.

In the devices in the above embodiments, the specific mode of performingoperation for each module has been described in detail in the methodembodiments, which will not be described herein.

FIG. 10 is a block diagram illustrating a terminal according to anexemplary embodiment of the present disclosure. The terminal 1000 mayinclude one or more of the following components: a processing component1002, a memory 1004, a power component 1006, a multimedia component1008, an audio component 1010, an input/output (I/O) interface 1012, asensor component 1014, and a communication component 1016.

The processing component 1002 typically controls overall operations ofthe terminal 1000, such as the operations associated with display,telephone calls, data communications, camera operations, and recordingoperations. The processing component 1002 may include one or moreprocessors 1020 to execute instructions to perform all or part of thesteps in the above described methods. Moreover, the processing component1002 may include one or more modules which facilitate the interactionbetween the processing component 1002 and other components. Forinstance, the processing component 1002 may include a multimedia moduleto facilitate the interaction between the multimedia component 1008 andthe processing component 1002.

The memory 1004 is configured to store various types of data to supportthe operation of the terminal 1000. Examples of such data includeinstructions for any applications or methods operated on the terminal1000, contact data, phonebook data, messages, pictures, video, etc. Thememory 1004 may be implemented using any type of volatile ornon-volatile memory devices, or a combination thereof, such as a staticrandom access memory (SRAM), an electrically erasable programmableread-only memory (EEPROM), an erasable programmable read-only memory(EPROM), a programmable read-only memory (PROM), a read-only memory(ROM), a magnetic memory, a flash memory, a magnetic or an optical disk.

The power component 1006 provides power to various components of theterminal 1000. The power component 1006 may include a power managementsystem, one or more power sources, and any other components associatedwith the generation, management, and distribution of power for theterminal 1000.

The multimedia component 1008 includes a screen providing an outputinterface between the terminal 1000 and the user. In some embodiments,the screen may include a liquid crystal display (LCD) and a touch panel(TP). If the screen includes the touch panel, the screen may beimplemented as a touch screen to receive input signals from the user.The touch panel includes one or more touch sensors to sense touches,swipes, and gestures on the touch panel. The touch sensors may not onlysense a boundary of a touch or swipe action, but also sense a period oftime and a pressure associated with the touch or swipe action. In someembodiments, the multimedia component 1008 includes a front cameraand/or a rear camera. The front camera and/or the rear camera mayreceive an external multimedia datum while the terminal 1000 is in anoperation mode, such as a photographing mode or a video mode. Each ofthe front camera and the rear camera may be a fixed optical lens systemor have focus and optical zoom capability.

The audio component 1010 is configured to output and/or input audiosignals. For example, the audio component 1010 includes a microphone(“MIC”) configured to receive an external audio signal when the terminal1000 is in an operation mode, such as a call mode, a recording mode, anda voice recognition mode. The received audio signal may be furtherstored in the memory 1004 or transmitted via the communication component1016. In some embodiments, the audio component 1010 further includes aspeaker to output audio signals.

The I/O interface 1012 provides an interface between the processingcomponent 1002 and peripheral interface modules, such as a keyboard, aclick wheel, buttons, and the like. The buttons may include, but are notlimited to, a home button, a volume button, a starting button, and alocking button.

The sensor component 1014 includes one or more sensors to provide statusassessments of various aspects of the terminal 1000. For instance, thesensor component 1014 may detect an open/closed status of the terminal1000, relative positioning of components, e.g., the display and thekeypad, of the terminal 1000, a change in position of the terminal 1000or a component of the terminal 1000, a presence or absence of usercontact with the terminal 1000, an orientation or anacceleration/deceleration of the terminal 1000, and a change intemperature of the terminal 1000. The sensor component 1014 may includea proximity sensor configured to detect the presence of nearby objectswithout any physical contact. The sensor component 1014 may also includea light sensor, such as a CMOS or CCD image sensor, for use in imagingapplications. In some embodiments, the sensor component 1014 may alsoinclude an accelerometer sensor, a gyroscope sensor, a magnetic sensor,a pressure sensor, or a temperature sensor.

The communication component 1016 is configured to facilitatecommunication, wired or wirelessly, between the terminal 1000 and otherdevices. The terminal 1000 can access a wireless network based on acommunication standard, such as WiFi, 2G, or 3G, or a combinationthereof In one exemplary embodiment, the communication component 1016receives a broadcast signal or broadcast associated information from anexternal broadcast management system via a broadcast channel. In oneexemplary embodiment, the communication component 1016 further includesa near field communication (NFC) module to facilitate short-rangecommunications. For example, the NFC module may be implemented based ona radio frequency identification (RFID) technology, an infrared dataassociation (IrDA) technology, an ultra-wideband (UWB) technology, aBluetooth (BT) technology, and other technologies.

In exemplary embodiments, the terminal 1000 may be implemented with oneor more application specific integrated circuits (ASICs), digital signalprocessors (DSPs), digital signal processing devices (DSPDs),programmable logic devices (PLDs), field programmable gate arrays(FPGAs), controllers, micro-controllers, microprocessors, or otherelectronic components, for performing the above described methods.

In exemplary embodiments, there is also provided a non-transitorycomputer-readable storage medium including instructions, such asincluded in the memory 1004, executable by the processor 1020 in theterminal 1000, for performing the above-described methods. For example,the non-transitory computer-readable storage medium may be a ROM, a RAM,a CD-ROM, a magnetic tape, a floppy disc, an optical data storagedevice, and the like.

There is a non-transitory computer-readable storage medium, when theinstructions in the storage medium are executed by a processor of theterminal, the terminal is enabled to perform the methods for voicecalling as shown in FIG. 2 or FIG. 3, or perform the methods for voiceplaying as shown in FIG. 4 or FIG. 5.

Other embodiments of the invention will be apparent to those skilled inthe art from consideration of the specification and practice of theinvention disclosed here. This application is intended to cover anyvariations, uses, or adaptations of the invention following the generalprinciples thereof and including such departures from the presentdisclosure as come within known or customary practice in the art. It isintended that the specification and examples be considered as exemplaryonly, with a true scope and spirit of the invention being indicated bythe following claims.

It will be appreciated that the present invention is not limited to theexact construction that has been described above and illustrated in theaccompanying drawings, and that various modifications and changes can bemade without departing from the scope thereof. It is intended that thescope of the invention only be limited by the appended claims.

1-26. (canceled)
 27. A method for voice processing, comprising:detecting whether a sensed distance value of an on-site terminal is lessthan a preset threshold; entering a predetermined mode if the senseddistance value is less than the preset threshold; and auto-playing avoice signal of an opposite terminal in the predetermined mode.
 28. Themethod of claim 27, further comprising: if a voice signal of the on-siteterminal is detected in the predetermined mode, collecting the voicesignal of the on-site terminal, and transmitting the voice signal of theon-site terminal to the opposite terminal.
 29. The method of claim 27,wherein the auto-playing the voice signal of the opposite terminal inthe predetermined mode comprises: in the predetermined mode,auto-playing the voice signal of the opposite terminal that has beenreceived but not yet played; or, in the predetermined mode, receivingthe voice signal of the opposite terminal, and auto-playing the receivedvoice signal of the opposite terminal.
 30. The method of claim 29,wherein the auto-playing the received voice signal of the oppositeterminal comprises: after receiving the voice signal of the oppositeterminal, detecting whether the voice signal of the on-site terminal isbeing collected; if the voice signal of the on-site terminal is beingcollected, buffering the voice signal of the opposite terminal;detecting whether the voice signal of the on-site terminal has beencollected completely; and if the voice signal of the on-site terminalhas been collected completely, auto-playing the buffered voice signal ofthe opposite terminal.
 31. The method of claim 30, wherein the detectingwhether the voice signal of the on-site terminal has been collectedcompletely comprises: detecting whether a preset length of time haselapsed after the collection has been completed.
 32. The method of claim29, wherein the receiving the voice signal of the opposite terminal inthe predetermined mode comprises: receiving text information of theopposite terminal in the predetermined mode; and synthesizing a voicesignal of the opposite terminal according to the text information of theopposite terminal.
 33. The method of claim 28, wherein the collectingthe voice signal of the on-site terminal, and transmitting the voicesignal of the on-site terminal to the opposite terminal when the voicesignal of the on-site terminal is detected in the predetermined mode,comprises: if the voice signal of the on-site terminal is detected whendisplaying the voice signal of the opposite terminal, collecting thevoice signal of the on-site terminal, and transmitting the voice signalof the on-site terminal to the opposite terminal.
 34. The method ofclaim 28, further comprising: receiving a prompt information sent fromthe opposite terminal, the prompt information prompting that theopposite terminal has entered the predetermined mode; and displaying theprompt information.
 35. The method of claim 27, further comprising:after entering the predetermined mode, detecting whether the senseddistance value of the on-site terminal is greater than the presetthreshold; and if the sensed distance value of the on-site terminal isgreater than the preset threshold, exiting from the predetermined mode.36. A device for voice calling, comprising: a processor; a memory forstoring instructions executable by the processor; wherein the processoris configured to: detect whether a sensed distance value of an on-siteterminal is less than a preset threshold; enter a predetermined callmode if the sensed distance value is less than the preset threshold;auto-play a voice signal of an opposite terminal in the predeterminedcall mode; and if a voice signal of the on-site terminal is detected inthe predetermined call mode, collect the voice signal of the on-siteterminal, and transmit the voice signal of the on-site terminal to theopposite terminal.
 37. A device for voice playing, comprising: aprocessor; a memory for storing instructions executable by theprocessor; wherein the processor is configured to: detect whether asensed distance value of an on-site terminal is less than a presetthreshold; enter a predetermined play mode if the sensed distance valueis less than the preset threshold; and auto-play a voice signal of anopposite terminal in the predetermined play mode.
 38. A non-transitorycomputer readable storage medium comprising instructions, executable bya processor in a terminal, for performing a method for voice calling,the method comprising: detecting whether a sensed distance value of theterminal is less than a preset threshold; entering a predetermined callmode if the sensed distance value is less than the preset threshold;auto-playing a voice signal of an opposite terminal in the predeterminedcall mode; and if a voice signal of the terminal is detected in thepredetermined call mode, collecting the voice signal of the terminal,and transmitting the voice signal of the terminal to the oppositeterminal.
 39. A non-transitory computer readable storage mediumcomprising instructions, executable by a processor in a terminal, forperforming a method for voice calling, the method comprising: detectingwhether a sensed distance value of the terminal is less than a presetthreshold; if the sensed distance value is less than the presetthreshold, entering a predetermined play mode; and auto-playing a voicesignal of an opposite terminal in the predetermined play mode.